Goto

Collaborating Authors

 ieee international conference




M4Singer: AMulti-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus

Neural Information Processing Systems

The lack of publicly available high-quality and accurately labeled datasets has long been a major bottleneck for singing voice synthesis (SVS). To tackle this problem, we present M4Singer, a free-to-use Multi-style, Multi-singer Mandarin singing collection with elaborately annotated Musical scores as well as its benchmarks. Specifically, 1) we construct and release a large high-quality Chinese singing voice corpus, which is recorded by 20 professional singers, covering 700 Chinese pop songs as well as all the four SATB types (i.e., soprano, alto, tenor, and bass); 2) we take extensive efforts to manually compose the musical scores for each recorded song, which is necessary to the study of the prosody modeling for SVS. 3) To facilitate the use and demonstrate the quality of M4Singer, we conduct four different benchmark experiments: score-based SVS, controllable singing voice (CSV), singing voice conversion (SVC) and automatic music transcription (AMT). Audio samples can be found at http://m4singer.github.io.


1dc2fe8d9ae956616f86bab3ce5edc59-Supplemental-Conference.pdf

Neural Information Processing Systems

We construct SEIDNet based on PyTorch1. There are 26 convolutional layers for extracting the visual feature map from the rainy image. The feature masking contains two convolutional layers. It computes the rain (or object) feature map. There is a pair of batch normalization and ReLU layers between the adjacent convolutional layers. The size of kernels in each convolutional layer is 3 3. Vid generates 3 3kernel for deraining each pixel.


Flare7K: APhenomenological Nighttime Flare Removal Dataset

Neural Information Processing Systems

Artificial lights commonly leave strong lens flare artifacts on images captured at night. Nighttime flare not only affects the visual quality but also degrades the performance of vision algorithms. Existing flare removal methods mainly focus on removing daytime flares and fail in nighttime. Nighttime flare removal is challenging because of the unique luminance and spectrum of artificial lights and the diverse patterns and image degradation of the flares captured at night. The scarcity of nighttime flare removal datasets limits the research on this crucial task.


Theseus: ALibrary for Differentiable Nonlinear Optimization Appendix AContributions

Neural Information Processing Systems

The contributions of the authors are as follows. Luis Pineda led the engineering of the project, developed and implemented the core API, differentiable nonlinear solvers, motion planning example and tutorials, standard and autodiff cost functions, and backward mode experiments, coordinated with sub-teams to help design, implement, integrate and review of all aspects of the code and evaluations, wrote the paper.


Habitat 2.0: Training Home Assistants to Rearrange their Habitat

Neural Information Processing Systems

We introduce Habitat 2.0 (H2.0), a simulation platform for training virtual robots in interactive 3D environments and complex physics-enabled scenarios. We make comprehensive contributions to all levels of the embodied AI stack - data, simulation, and benchmark tasks.